AITopics | deep linear network

Neural networks are known to develop latent representations that are $aligned$, namely structurally similar across networks trained with different architectures, training protocols, or training datasets. We study this phenomenon in a controlled setting, where we train an ensemble of networks on regression and classification tasks using training sets perturbed by independent realizations of a noise process. We show that the signal-to-noise ratio (SNR) and the training sample size influence the alignment in qualitatively similar ways in networks trained on real-world datasets and in an extremely simple $linear$ network with a single hidden layer, for which the alignment can be estimated analytically. Across linear and nonlinear networks, regression and classification tasks, and both synthetic and real-world data, we consistently observe that alignment varies monotonically with SNR but non-monotonically with training sample size. In particular, the alignment is minimized near the interpolation threshold, and a stronger alignment does not necessarily correspond to better generalization error. These findings reveal a non-trivial dependence of alignment on data quality and quantity, decoupled from generalization performance.

artificial intelligence, machine learning, representation, (18 more...)

arXiv.org Machine Learning

2605.26973

Country: Europe > Italy (0.28)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

2b3bb2c95195130977a51b3bb251c40a-Paper-Conference.pdf

Neural Information Processing SystemsApr-25-2026, 06:15:05 GMT

artificial intelligence, machine learning, representation, (13 more...)

Neural Information Processing Systems

Country:

North America > United States (0.28)
North America > Canada (0.28)
Europe > United Kingdom > England (0.28)

Genre: Research Report (0.46)

Industry:

Education (1.00)
Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)

Add feedback

Faster Directional Convergence of Linear Neural Networks under Spherically Symmetric Data

Neural Information Processing SystemsApr-25-2026, 04:00:28 GMT

In this paper, we study gradient methods for training deep linear neural networks with binary cross-entropy loss. In particular, we show global directional convergence guarantees from a polynomial rate to a linear rate for (deep) linear networks with spherically symmetric data distribution, which can be viewed as a specific zero-margin dataset. Our results do not require the assumptions in other works such as small initial loss, presumed convergence of weight direction, or overparameterization. We also characterize our findings in experiments.

artificial intelligence, convergence, machine learning, (14 more...)

Neural Information Processing Systems

Country: Asia (0.28)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

24ec8468b67314c2013d215b77034476-Paper.pdf

Neural Information Processing SystemsApr-25-2026, 04:00:15 GMT

artificial intelligence, convergence, machine learning, (14 more...)

Neural Information Processing Systems

Country: Asia (0.28)

Genre: Research Report > New Finding (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.31)

Add feedback

1baff70e2669e8376347efd3a874a341-Supplemental.pdf

Neural Information Processing SystemsApr-24-2026, 23:10:40 GMT

artificial intelligence, derivation, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America (0.28)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.68)

Add feedback

1baff70e2669e8376347efd3a874a341-Paper.pdf

Neural Information Processing SystemsApr-24-2026, 23:10:37 GMT

artificial intelligence, linear network, machine learning, (14 more...)

Neural Information Processing Systems

Country: North America > United States > New York (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Implicit Bias of (Stochastic) Gradient Descent for Rank-1 Linear Neural Network

Neural Information Processing SystemsFeb-16-2026, 16:48:15 GMT

Unfortunately, even for standard linear networks in regression setting, a comprehensive characterization of the implicit bias is still an open problem.

artificial intelligence, linear network, machine learning, (18 more...)

Neural Information Processing Systems

Country: Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.46)

Technology: